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Abstract 

Background: Chronic obstructive pulmonary disease (COPD) is a heterogeneous disease consisting of emphysema, 
small airway obstruction, and/or chronic bronchitis that results in significant loss of lung function over time. 

Methods: In order to gain insights into the molecular pathways underlying progression of emphysema and 
explore computational strategies for identifying COPD therapeutics, we profiled gene expression in lung tissue 
samples obtained from regions within the same lung with varying amounts of emphysematous destruction from 
smokers with COPD (8 regions x 8 lungs = 64 samples). Regional emphysema severity was quantified in each 
tissue sample using the mean linear intercept (Lm) between alveolar walls from micro-CT scans. 

Results: We identified 127 genes whose expression levels were significantly associated with regional emphysema 
severity while controlling for gene expression differences between individuals. Genes increasing in expression with 
increasing emphysematous destruction included those involved in inflammation, such as the B-cell receptor 
signaling pathway, while genes decreasing in expression were enriched in tissue repair processes, including the 
transforming growth factor beta (TGFp) pathway, actin organization, and integrin signaling. We found concordant 
differential expression of these emphysema severity-associated genes in four cross-sectional studies of COPD. Using 
the Connectivity Map, we identified GHK as a compound that can reverse the gene-expression signature associated 
with emphysematous destruction and induce expression patterns consistent with TGFp pathway activation. 
Treatment of human fibroblasts with GHK recapitulated TGFp-induced gene-expression patterns, led to the 
organization of the actin cytoskeleton, and elevated the expression of integrin p1. Furthermore, addition of GHK or 
TGFp restored collagen I contraction and remodeling by fibroblasts derived from COPD lungs compared to 
fibroblasts from former smokers without COPD. 

Conclusions: These results demonstrate that gene-expression changes associated with regional emphysema 
severity within an individual's lung can provide insights into emphysema pathogenesis and identify novel 
therapeutic opportunities for this deadly disease. They also suggest the need for additional studies to examine the 
mechanisms by which TGFp and GHK each reverse the gene-expression signature of emphysematous destruction 
and the effects of this reversal on disease progression. 
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Background 

Chronic obstructive pulmonary disease (COPD) is a sig- 
nificant public health problem worldwide and the third 
leading cause of death in the United States [1]. It is char- 
acterized by irreversible airflow limitation due to obstruc- 
tion in the small conducting airways and emphysematous 
destruction of the gas exchanging tissue of the lung. 
Tobacco smoke is a significant risk factor for COPD and 
at least 25% of smokers develop this disease [2]. Current 
theories concerning disease pathogenesis include an 
imbalance between protease and anti-protease activity, 
induced apoptosis of alveolar cells through deregulation 
of pathways involved in oxidative stress, chronic inflam- 
mation, and aberrant tissue remodeling that lead to the 
destruction of the extracellular matrix (ECM) in the lung 
[3,4]. Lung repair and regeneration are potential pro- 
cesses to target with novel therapeutics in COPD as 
abnormal tissue repair by the epithelial-mesenchymal 
trophic unit can result in either fibrosis or destruction of 
the ECM [5]. However, the molecular mechanisms 
responsible for the pathogenesis of COPD remain poorly 
understood. 

Several groups have profiled gene expression in patients 
with and without COPD or between patients with varying 
levels of airflow obstruction in order to understand differ- 
ences in gene expression related to COPD [6-11]. While 
these studies have provided an initial look into the COPD 
transcriptome, their results primarily relied on the use of 
lung function tests to define the presence or degree of 
COPD. Lung function phenotypes can neither distinguish 
between obstruction in the small airways and emphysema- 
tous destruction of the lung parenchyma nor provide 
information about regional differences in disease severity. 
Recently, McDonough et al. used micro-CT scans to 
quantify the degree of emphysema in different regions of 
lungs from patients with severe COPD by measuring the 
mean linear intercept (Lm), a morphological measurement 
of alveolar destruction [12]. In order to gain insights into 
biological pathways associated with increasing emphysema 
severity within a patient and explore computational strate- 
gies for identifying COPD therapeutics, we obtained paired 
samples from eight regions at regular intervals between 
the apex and base of each explanted lung from six patients 
with severe COPD (Global Initiative for Chronic Obstruc- 
tive Lung Disease (GOLD) stage IV) and two donor lungs. 
The degree of emphysematous destruction was quantified 
in one tissue sample from each region by Lm, while gene 
expression was profiled in the adjacent tissue sample from 
the same region. 

We identified a number of genes whose expression is 
associated with increasing emphysematous destruction and 
found that pathways enriched among these genes were 
involved in the immune response and tissue remodeling. 
Using the Connectivity Map (CMap) [13], we found that 



the tripeptide Gly-His-Lys (GHK) was able to reverse the 
aberrant patterns of gene expression associated with 
increasing emphysema severity and induce patterns of gene 
expression consistent with transforming growth factor beta 
(TGF(3) pathway activation. Furthermore, we showed that 
by treating distal lung fibroblasts from COPD patients with 
GHK, we can restore normal contractile function through 
re-organization of the actin cytoskeleton and up-regulation 
of integrin-pi. These data further support the potential of 
GHK as a therapeutic in the treatment of emphysema. 

Materials and methods 

Sample acquisition and processing 

Single lungs (n = 6) were removed from patients treated 
for severe COPD by double lung transplantation at the 
University of Pennsylvania. Donor lungs (n = 2) for 
which no suitable recipient was identified were released 
for research use from the Gift of Life Organ Procure- 
ment Organization in Philadelphia. This study was 
approved by the institutional review boards and con- 
forms to the Helsinki Declaration. Written informed 
consent for use of these specimens and the relevant 
clinical and radiological data required for this research 
were obtained from each patient prior to surgery and 
from the next of kin of the persons whose donated lung 
was released for research. Each lung was removed from 
the thorax, cooled to 1.6°C, and transported to the 
laboratory where the bronchial stump was cannulated 
[14]. The lung was then inflated using a compressed air 
source attached to an underwater seal to slowly increase 
transpulmonary pressure (PL) from 0 to 30 cmH 2 0. 
The specimen was then held at a transpulmonary pres- 
sure of 10 cmH 2 Owhile frozen by liquid nitrogen vapor 
(-130°C). The frozen specimen had a multidetector CT 
scan followed by being cut into 2-cm thick slices in the 
same plane as the CT scan. Tissue samples were col- 
lected using a sharpened steel cylinder (cork bore dia- 
meter of 14 mm). One sample from a cluster of four 
core samples of lung obtained from each site was pro- 
cessed for micro-CT [12]. A companion core from the 
same cluster was used for the gene profiling and valida- 
tion studies reported here. The representative nature of 
these samples with respect to the entire lung was estab- 
lished by comparing the densities of the sampled sites 
with the frequency distribution of the densities in the 
entire lung on multidetector CT as reported in McDo- 
nough et al. [12]. 

Measurement of mean linear intercept 

The severity of emphysema within each core was esti- 
mated by measuring Lm. A micro-CT scan of each core 
provided approximately 1,000 contiguous 16-|im thick 
images. Lm was measured at 20 regularly spaced intervals 
of each of the micro-CT scans using a previously 
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validated grid of test lines projected onto the image and a 
custom macro linked to specialized software (ImagePro 
Plus; MediaCybernetics (Rockville, MD, USA). The num- 
ber of intercepts between these lines and tissue was 
counted. Lm was calculated as the total length of the test 
lines divided by the number of cross-overs with tissue 
(equal to the number of intercepts divided by 2). 

Microarray sample processing 

High molecular weight (mRNA-containing fraction) 
RNA was isolated from tissue cores using the miRNeasy 
Mini Kit (Qiagen, Valencia, CA, USA). RNA integrity 
was assessed using an Agilent 2100 Bioanalyzer and 
RNA purity was assessed using a NanoDrop spectro- 
photometer. RNA (1 \ig) was processed and hybridized 
onto the Human Exon 1.0 ST array (Affymetrix Inc., 
Santa Clara, CA, USA) according to the manufacturers 
protocol as previously described [15]. Expression Con- 
sole Version 1.1 (Affymetrix Inc.) was used to generate 
transcript-level gene expression estimates for the 'core' 
exon probesets via the robust multichip average (RMA) 
algorithm. Gene symbols of transcript IDs were 
retrieved using DAVID [16]. These gene expression data 
are available through the Gene Expression Omnibus 
(GEO) under the accession GSE27597. 

Microarray data analysis 

Two linear mixed-effects models were used to identify 
gene expression profiles associated with the degree of 
regional emphysema severity as measured by Lm: 

1. Geney = p 0 + Psiice x Slice;, + oij + e,j 

2. Geney = Po + Psiice x Slice^- + p Lm x Lm^ + Oj + Sy 
i= 1,2,..., 8; j = 1,2,..., 8 

£lj ~ N(0, a 2 )aj ~ N(0,^) 

Gene^ is the log 2 expression value for sample i in patient 
j for a single gene. Slice is a fixed effect controlling for the 
position within the lung from which the sample core was 
obtained. The random term s /; represents the random 
error, which was assumed to be normally distributed, a ; 
represents the random effect for patient, and p 0 represents 
the intercept. Model 2 contains an additional fixed effect 
term for emphysema severity measured by the natural log 
of Lm. A gene's expression profile was considered asso- 
ciated with Lm if model 2 fit better than model 1 as deter- 
mined by a significant P-value from a likelihood ratio test 
between the two models after applying a false discovery 
rate (FDR) correction. In the immunohistochemistry 
experiments, these linear models were also used to exam- 
ine the relationship between Lm and the volume fraction 
of tissue with positive staining by substituting volume frac- 
tion (Vv) for gene expression as a dependent variable. All 
statistical analyses were conducted using R statistical 



software v2.9.2 and the nlme package in Bioconductor 
v2.4 [17]. 

Functional enrichment analysis 

Functional enrichment analysis was performed using 
DAVID 2008 or Gene Set Enrichment Analysis (GSEA) 
v2.0.7 [16,18]. For DAVID, functional enrichment was 
examined among Gene Ontology categories, and KEGG 
and BIO CARTA pathways. All genes in the species Homo 
sapiens were used as a reference set. For GSEA, genes 
were ranked by the t-statistic of the p Lm coefficient in the 
linear mixed-effects model and then analyzed for the 
enrichment of canonical pathways and Gene Ontology 
term gene sets obtained from MSigDB v2.5. 

Connecting to other gene-expression datasets 

Using GSEA, sets of genes reported to change with 
COPD-related phenotypes or with TGFp treatment in 
other gene-expression studies were examined in a ranked 
list of genes ordered from most induced in severe emphy- 
sema to most repressed in severe emphysema by the 
t-statistic of the p Lm coefficient in the linear mixed- 
effects model. Conversely, sets of genes we identified as 
significantly positively or negatively associated with Lm 
were examined within gene lists ranked by the degree of 
differential expression as determined by re-analyzing pre- 
viously published COPD-or TGFp -related microarray 
studies. See Additional file 1 for a description of the data 
normalization procedures and statistical analyses used to 
generate gene sets and/or ranked gene lists for each of 
the previously published gene-expression datasets. 

Connectivity Map 

In order to find compounds that reverse gene-expression 
patterns associated with emphysema severity, we gener- 
ated separate signatures for each COPD or TGFp gene- 
expression dataset examined in this study. Signatures were 
generated by identifying the 50 genes most up-regulated 
and the 50 genes most down-regulated with respect to a 
COPD or TGFp -related phenotype. Each signature was 
queried against the CMap using the algorithm described 
by Lamb et al. [13]. See Additional file 1 for a description 
of the statistical analysis used to generate each query sig- 
nature for each phenotype within each dataset. The list of 
all CMap query signatures used in this analysis include: 1) 
genes that change in expression as a function of regional 
emphysema severity in this study; genes that change in 
expression with 2) forced expiratory volume in 1 second 
(FEVl), 3) FEVi/forced vital capacity (FVC), or 4) between 
cases versus controls in Bhattacharya et al [7]; genes that 
change in expression between 5) controls versus emphy- 
sema patients or between 6) controls versus a 1 -antitrypsin 
disease in Golpon et al. [6]; genes that change in expres- 
sion with 7) FEVi or 8) diffusing capacity of carbon 
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monoxide (DLCO) in Spira et al.[9]; genes that change in 
expression with 9) FEVi, 10) FEV^FVC, 11) DLCO, 12) 
non-smokers versus GOLD2, or 13) non-smokers versus 
GOLD3 in Wang et al.[10]; and genes that change in 
expression with TGFp treatment from 14) Qin et al.[19], 
15) Classen et a/. [20], 16) Renzoni et al.[21], 17) Koinuma 
et al.[22], and 18) Malizia et al.[23]. For comparison of the 
CMap data to our in vitro studies of the effects of GHK in 
primary lung fibroblasts, raw data for GHK-treated and 
control samples were downloaded from the CMap website 
and normalized using MAS5.0 with the Affymetrix CDF. 
Genes were ranked by a paired £-test between treatment 
and controls of different batches and compared to gene 
sets of GHK and TGFp treatment using GSEA. 

Isolation and culture of lung fibroblasts 

Lung tissue of former smokers (defined as quitting smok- 
ing for at least one year before surgery) with normal lung 
function or GOLD stage IV COPD was obtained from 
patients undergoing surgery for resection for pulmonary 
carcinoma or lung transplantation. Fibroblast cultures 
were established from parenchymal lung tissue by an 
explant technique as previously described [24]. Isolated 
cells were characterized as fibroblasts by morphological 
appearance and expression pattern of specific proteins as 
described previously [24,25]. Fibroblast cultures were 
stored into liquid nitrogen until use. 

Immunofluorescence 

Fibroblast cultures at passage 3 were cultured in eight- 
well chamber slides (Gibco, Burlington, ON, Canada) in 
growth medium (DMEM, 10% fetal bovine serum (FBS), 
penicillin, and streptavidin from Invitrogen, Burlington, 
ON, Canada). After reaching 70% confluence, fibroblasts 
were cultured for 24 h in 1% FBS DMEM and then incu- 
bated with either TGFpi lOng/ml (Peprotech, Dollard 
des Ormeaux, Quebec, Canada), GHK 10 nM (Sigma, 
Markham, Ontario, Canada) or control media (1% FBS 
DMEM, penicillin, streptavidin) for a further 48 h. After 
stimulations, chamber slides were fixed with 4% parafor- 
maldehyde for 20 minutes, blocked in 10% goat serum in 
phosphate-buffered saline (PBS) with 0.1% saponin for 
1 h and then stained with integrin-pi antibody (M-106, 
Santa Cruz Biotechnology, Santa Cruz, CA, USA) in 0.1% 
saponin in PBS for 2 h at room temperature. Following 
washing in PBS with 0.1% saponin and 0.1% Tween 20, 
secondary antibody conjugated with goat anti-Mouse IgG 
Alexa Fluor 488 and Phalloidin conjugated with Alexa 
Fluor 594 were incubated for 2 h at room temperature. 
Following final washes, cultures were incubated with 
DAPI 1 ng/ml and then coverslipped with cytoseal. 
Confocal images were acquired with a Leica AOBS SP2 
laser scanning confocal microscope (Leica, Heidelberg, 
Germany). The images were overlaid and the contrast 



enhancements were performed on the images using 
Volocity software™ (Improvisions Inc., Boston, MA, 
USA) as previously described [26]. 

Collagen gel contraction assays 

Fibroblast cultures at passage 3 were cultured in six-well 
tissue culture plates (Gibco, Canada) in growth medium 
(DMEM, 10% FBS, penicillin, and streptavidin from 
Invitrogen, Canada). After reaching 70% confluence, 
fibroblasts were cultured for 24 h in 1% FBS DMEM 
and then incubated with TGFpi 10 ng/ml (Peprotec, 
Canada), GHK 10 nM (Sigma, Canada) or growth media 
control for a further 48 h. Prior to the end of the treat- 
ment time point, a 12-well tissue culture plate was incu- 
bated with 1% bovine serum albumin in DMEM for 2 h. 
The medium was removed and then 500 \A of 0.4 mg/ml 
type I collagen (BD Biosciences, Mississauga, ON, 
Canada) was added and allowed to polymerize for 8 h at 
37°C. The treated fibroblasts were then trypsinized and 
seeded at 2 x 10 5 cells/500 ul of 1% FBS DMEM, penicil- 
lin, streptavidin in duplicate on the collagen gels and cul- 
tured for an additional 24 h at 37°C in 5% C0 2 . The gels 
were imaged before and after and the extent of gel con- 
traction measured using Image Pro Software. 

Multi-photon and second harmonic generation 
microscopy 

The collagen gels were fixed with 4% paraformaldehyde 
for 20 minutes and washed in PBS with 0.1% saponin 
and 0.1% Tween 20 before being incubated with phalloi- 
din conjugated with Alexa Fluor 594 for 1 h at room 
temp. Gels were then mounted on to a glass slide using 
Secure-seal™ imaging spacers (size 20 mm; Sigma) and 
aqueous mounting media. The gels were then imaged 
using second harmonic generation microscopy to deter- 
mine fibrilar collagen as previously described [27]. For 
each cell volume, Z-section images were compiled and 
the three-dimensional image restoration was performed 
using Volocity software (Improvisions, Inc.). A noise- 
removal filter with a kernel size of 3 x 3 was applied to 
these three-dimensional images. 

Results 

Study population 

Lm was quantified using micro-CT scans in eight sam- 
ples taken at regular intervals from apex to base of lungs 
from six subjects that required transplantation for COPD 
and two organ donors (Figure 1). Table 1 shows demo- 
graphic information and clinical characteristics of the 
eight subjects used in this study. As expected, samples 
from subjects with COPD had a higher mean and a 
greater range of Lm values between samples compared to 
those from donor lungs, indicating that there are regions 
of severe emphysema in COPD subjects (Table 1). 
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Donors COPD 
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(Low Lm) (High Lm) 



Sum length of line segments 

Lm = Total number of intercepts between 
line segments and the tissue 



Figure 1 Outline of study design, (a) Whole lungs were removed from patients with severe COPD and from donors, inflated with air, and 

rapidly frozen in liquid nitrogen vapor, (b) The frozen specimens were cut into 2-cm slices from apex to base of the lung, (c) Adjacent tissue 

cores were removed from 8 different slices of each lung (8 patients with 8 slices = 64 total regions), (d) Micro-CT was used to measure Lm at 20 

evenly spaced intervals throughout one core from each region. 
I J 



Table 1 Subject demographics for lung tissue samples 


Patient ID 


Description 


Sex 


Age 


Pack years 


Smoking status 


Lm mean ± SD (urn) 


Lm range (urn) 


6965 


COPD 


M 


62 


50 


Former 


716 ± 164 


494-982 


6967 


COPD 


F 


61 


25 


Former 


414 ± 82 


334-585 


6968 


COPD 


F 


63 


38 


Former 


724 ± 252 


357-1,013 


6969 


COPD a ' b 


F 


56 


54 


Former 


1,822 ± 1270 


521-4,620 


6970 


COPD c 


M 


55 


15 


Former 


1,352 ± 599 


647-2,551 


6971 


COPD 


M 


59 


30 


Former 


1,097 ± 441 


720-2,101 


6982 


Donor 


M 


59 




Never 


384 ± 47 


344-473 


6983 


Donor 


M 


62 


24 


Former 


289 ± 41 


231-352 



Subjects with COPD had FEV/FVC <70% and FEVt <25% predicted. a c Some patients had other diseases: a von Willebrand disease; b hypertension; c a1 -antitrypsin 
deficiency disease. 
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Subject 6967 was diagnosed with a pure airway obstruc- 
tion COPD phenotype without emphysema [28]. Consistent 
with this diagnosis, the distribution of Lm measurements 
for this patient closely resembles the distribution of Lm 
measurements from the donor lungs. Subject 6970 was 
diagnosed with a 1 -antitrypsin deficiency. The remaining 
four subjects with COPD had the centrilobular emphysema- 
tous phenotype commonly observed in smokers. The distri- 
bution of emphysematous destruction in tissue cores from 
these COPD patients range from little to no emphysema 
(Lm < 600) to very severe emphysema (Lm > 1,000) [12]. 
Subject 6969 had one sample excluded from subsequent 
analysis because its Lm measurement was an outlier (more 
than three times the interquartile range of the distribution 
of Lm measurements in all cores from all lungs examined). 

Pathways associated with regional emphysema severity 

Using linear mixed-effect models, the expression levels of 
127 genes were significantly associated with Lm and thus 
associated with regional emphysema severity (Figure 2a; 
FDR <0.10; see Additional file 2 for the analytic results 
for all genes). Using DAVID [16] or GSEA [18], we found 
that genes with functions in the B-cell receptor signaling 
pathway were over-represented among the up-regulated 
genes, while genes involved in cellular structure, integrin 
signaling, extracellular matrix production, focal adhesion, 
blood vessel morphogenesis, and the vascular endothelial 
growth factor and TGFp pathways were enriched among 
the down-regulated genes (FDR <0.05; see Additional 
file 3 for a list of all significantly enriched pathways). The 
expression of CD79A, a component of the B-cell recep- 
tor, increased in expression with increasing emphysema 
severity (Figure 2b), and the expression of ACVRL1 (also 
known as activin-like kinase I), a receptor in the TGFp 
pathway, decreased in expression with increasing emphy- 
sema severity (Figure 2c). These two genes are shown as 
examples of the characteristic relationship between Lm 
and gene expression as observed in Figure 2a. To predict 
transcription factors that might be responsible for the 
observed patterns of differential expression, we inferred a 
gene expression relevance network using the Context 
Likelihood of Relatedness (CLR) algorithm [29]. Tran- 
scription factors with the most connections to other 
genes included EPAS1 (also known as HIF-2a), KLF13, 
TALI, TBX3, GATA2, and BCL11A (Additional file 4). 
Fourteen genes whose expression is significantly corre- 
lated with regional emphysema severity or transcription 
factors that are highly connected to these genes in the 
relevance network were selected for quantitative RT-PCR 
validation in a subset of tissue cores from subjects with 
severe emphysema (see Additional file 1 for methods). 
Twelve out of the fourteen genes had a significant corre- 
lation between the expression values derived from the 
microarray and quantitative RT-PCR, showing that the 



association of gene expression with regional emphysema 
severity is reproducible across assays (Pearson correla- 
tion, P < 0.05; Additional file 5). 

In order to demonstrate that the 127 gene signature is 
related to regional emphysema severity within individuals 
and not to differences between donors and COPD patients 
or to differences in levels of emphysema between COPD 
patients, we repeated the same statistical analysis while only 
including the five COPD patients with emphysema and 
standardizing the Lm measurements within each patient 
core to a mean of zero and a standard deviation of one 
(Z-score). Using GSEA, the sets of up-and down-regulated 
genes in the 127-gene signature identified in the previous 
analysis with all eight patients and unsealed Lm measure- 
ments were concordantly enriched among genes differen- 
tially expressed when only the five emphysema patients 
were analyzed with Z-scored Lm measurements, indicating 
that this gene signature is associated with regional emphy- 
sema severity (FDR <0.001, GSEA; see Additional file 6 for 
the enrichment plot). 

Validation of pathways up-regulated in regions of severe 
emphysema 

In order to investigate whether the up-regulation of com- 
ponents of the B cell receptor signaling pathway is asso- 
ciated with a change in the quantity of B cells in lung 
tissue, we quantified the Vv of CD79A protein, a marker 
for B cells, in relation to Lm by immunohistochemistry 
(see Additional file 1 for methods). CD79A-positive B cells 
were observed in the alveolar and small airway wall tissue 
(Figure 3). Vv was quantified in alveolar tissue for all 64 
samples and in small airway tissue for 43 samples that con- 
tained small airways and was found to be positively corre- 
lated to Lm in both the alveolar and small airway wall 
tissue (P < 0.001), indicating that B cell abundance 
increases as emphysema severity increases. 

Validation of pathways down-regulated in regions of 
severe emphysema 

Several members of the TGFp pathway were among the 
genes that had decreased expression as a function of regio- 
nal emphysema severity. These genes included ACVRL1, 
ENG, TGFBR2, and SMAD6. Other components in this 
family, including BMPR2 (FDR q-value = 0.125) and 
SMAD7 (FDR q-value = 0.223), also showed evidence of 
modest down-regulation while SMAD1 showed evidence 
of modest up-regulation (FDR q-value = 0.101). To deter- 
mine whether the TGFp pathway might be affected by 
emphysema pathogenesis, we used seven previously pub- 
lished studies that had examined the effect of TGFp 
ligands on gene expression to develop a collection of sig- 
natures of TGFp pathway activation [19-23,30,31]. Genes 
that exhibited significantly decreased expression with 
increasing emphysema severity were enriched among 
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Figure 2 Gene expression signature of regional emphysema severity, (a) Supervised heatmap of genes whose expression is associated with 
Lm (FDR <0.10). Samples are organized from low to high Lm. Each row corresponds to a gene and each column corresponds to a sample. 
Green represents lower relative expression and red represents higher relative expression. (b,c) Expression of CD79A (b) and ACVRL1 (c) are 
plotted against the natural log of Lm with the color of each point indicating the subject from which the sample was derived. 



genes induced in response to TGFp treatment in a total of 
three datasets (FDR <0.05, GSEA). Similarly, the sets of 
genes most induced by TGFp from each of the seven data- 
sets examined were enriched among genes whose expres- 
sion decreased as a function of emphysema severity (FDR 
<0.05, GSEA). As an example, the enrichment of genes 
associated with emphysema severity among genes chan- 
ging with TGFp treatment in the dataset from Malizia 
et al. [23] is shown in Figure 4a,b. See Additional file 7 for 
the GSEA enrichment plots for all seven datasets. 

To further validate these findings, we cultured human 
lung fibroblasts with and without TGFpi and found that 
the set of genes most induced by TGFpi were enriched 
among genes that decrease in expression with increasing 
regional emphysema severity (FDR <0.05, GSEA; see Addi- 
tional file 8 for the GSEA enrichment plot and Additional 
file 1 for the fibroblast culture methods). Immunostaining 
of lung tissue from the same regions on which we 



performed gene expression analysis localized SMAD2, a 
down-stream signal transducer of TGFp, to the alveolar 
and airway walls while members of the bone morphoge- 
netic protein (BMP) pathway, including SMAD6 and 
SMAD1, were primarily seen in vascular endothelial cells 
(Additional file 9). 

Relationship to expression profiles in other COPD studies 

In order to show that the gene expression signature of 
regional emphysema severity is present in larger cohorts 
of patients with earlier stages of disease, we used GSEA to 
examine the relationship between genes associated with 
regional emphysema severity in this dataset and genes 
associated with COPD phenotypes in other cross-sectional 
studies [6-11]. The genes that decreased in expression 
with increasing emphysema severity were significantly 
enriched among genes down-regulated as a function of 
COPD-related phenotypes in four of the five previously 
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Figure 3 Validation of differential expression for CD79A by immunohistochemistry. Representative images of CD79A-positive cells (arrows) 
in the alveolar tissue and the small airway walls. Positive staining appears red. Scale = 200 urn; inset = 10 urn. 



published datasets that we examined (FDR <0.05, GSEA; 
Additional file 10). In addition, genes that increased in 
expression with increasing emphysema severity were 
enriched amongst genes up-regulated as a function of 
COPD-related phenotypes in three of the five datasets 
(FDR <0.05, GSEA). As an example, the enrichment of 
genes associated with regional emphysema severity among 
genes differentially expressed with the presence of emphy- 
sema in the dataset from Golpon et al.[6] is shown in 
Figure 4c,d. Conversely, sets of genes reported to be differ- 
entially expressed with COPD in four of the six other 
cross-sectional studies were enriched among the genes 
changing in expression with increasing regional emphy- 
sema severity (FDR <0.05, GSEA; Additional file 10). 
Examples of genes validated by quantitative RT-PCR in 
this study and concordantly differentially expressed in 
other studies are shown in Additional file 11. Many of 
these datasets, such as Bhattacharya et al.[7] and Wang 
et al. [10], contained larger numbers of patients with a vari- 
ety of stages of disease (for example, GOLD stage 0 
through GOLD stage IV). The enrichment of genes asso- 
ciated with regional emphysema severity with COPD- 
related phenotypes in these other datasets suggests that 



the biological processes associated with increasing emphy- 
sema severity within a patient with severe COPD also vary 
in individuals with earlier stages of disease. 

Prediction of novel therapeutics for emphysema 

In order to identify compounds that might reverse the 
gene-expression pattern associated with progression of 
emphysema, we utilized the CMap [13], a compendium of 
microarray experiments that measure the effect of thera- 
peutic compounds on gene expression in cancer cell lines. 
Signatures of genes that 1) change in expression with 
regional emphysema severity in this dataset, 2) change in 
expression with lung function measures in other datasets 
[6,7,9,10], or 3) change in expression with TGFp treatment 
in other datasets [19-23] were each used as separate 
queries into the CMap data. We found that gene expres- 
sion changes resulting from treatment with the tripeptide 
GHK, a compound thought to accelerate wound healing 
[32,33], were negatively correlated with expression pat- 
terns associated with increasing regional emphysema 
severity (P = 0.006) and the COPD-related expression pat- 
terns observed in Bhattacharya et al.[7] and Golpon et al 
[6] (P < 0.05). In addition, the gene expression effects of 
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Figure 4 Relation between gene expression changes associated with regional emphysema severity (Lm) and other gene-expression 
studies by GSEA. (a) Relation between gene expression changes associated with regional emphysema severity and those induced by TGF|3 
treatment of A549 cells from Malizia et a I. [23]. The color bar represents the fold change between the cell lines treated with and without 
TGF(31for 11,910 genes in Malizia et a I. [23]. Red indicates a more positive fold change and green indicates a more negative fold change (induced 
or repressed with TGFp, respectively). The vertical lines represent the position of genes associated with regional emphysema severity in the 
ranked gene list. The height of the vertical lines corresponds to the magnitude of the running enrichment score from GSEA. Blue vertical lines 
indicate that the gene is part of the 'core' enrichment (that is, all the genes from the absolute maximum enrichment score to the end of the 
ranking), (b) Supervised heatmaps of relative gene expression levels for the core enrichment genes in both the regional emphysema and Malizia 
et al. datasets (1 1 genes down-regulated with emphysema severity but up-regulated with TGFp). Each gene is represented in the same row 
across heatmaps. (c) Genes changing in expression with increasing regional emphysema severity were enriched in the cross-sectional study of 
COPD-related gene expression from Golpon et a I. [6]. The color bar represents the t-statistic from a t-test between five emphysema patients and 
five non-smokers for 5,209 genes in Golpon et a I. [6]. Blue and orange vertical lines indicate that the gene is part of the core enrichment, 
(d) Supervised heatmaps of relative gene expression levels for the core enrichment genes in both the regional emphysema and Golpon datasets 
(8 genes concordantly up-regulated; 19 genes concordantly down-regulated). 



GHK are similar to the effects of TGFp treatment 
observed by Malizia et al.[23] (P = 0.004). 

As the CMap examined the effect of GHK in cancer cell 
lines, we next sought to verify the effect of GHK treatment 
in a cell type more relevant to emphysema pathogenesis. 
We utilized human lung fibroblasts because fibroblasts are 
the major interstitial cell within the alveolar unit that can 
synthesize and remodel the ECM and previous studies 
have demonstrated that GHK can induce ECM production 
in dermal fibroblasts [32-34]. Human lung fibroblast cul- 
tures were treated with two concentrations of GHK or 
with TGFpi (see Additional file 1 for methods). Gene 
expression profiling of these cells demonstrated that the 
200 genes most induced by GHK at 1 [iM in cancer cell 
lines in the CMap dataset were enriched among genes that 
increased after treatment with GHK at 0.1 nM in 



fibroblast cultures (FDR <0.05, GSEA). Furthermore, 
genes whose expression is decreased with increasing 
emphysema severity are enriched among genes induced by 
GHK at 10 nM (FDR < 0.05, GSEA; Figure 5a,b). Genes 
whose expression is altered by GHK treatment at either 
concentration are also enriched among genes that change 
with TGFpi treatment (FDR <0.05, GSEA; Figure 5c,d). 
See Additional file 8 for the GSEA enrichment plots show- 
ing the relationship between GHK, TGFp, and emphysema 
severity signatures. 

Reversal of COPD-related phenotypes in fibroblasts 
by GHK 

Genes induced in human lung fibroblasts after treat- 
ment with GHK were enriched in actin cytoskeleton 
organization and focal adhesion pathways (FDR < 0.05, 
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Figure 5 Effect of GHK treatment on expression in human lung fibroblasts, (a) Genes decreasing in expression with increasing regional 
emphysema severity were enriched among genes that are induced by GHK at 10 nM. (b) Supervised heatmaps of relative gene expression 
levels for the core enrichment genes in both datasets (18 genes down-regulated with Lm but up-regulated with GHK). Each gene is represented 
in the same row across heatmaps. (c) Genes differentially expressed with treatment of GHK at 0.1 nM were concordantly enriched among genes 
that change with treatment of TGF(31. (d) Heatmap of relative gene expression levels for the core enrichment genes (1 18 genes up-regulated 
and 124 down-regulated with both GHK and TGF|31). 



DAVID). These included integrins involved in collagen 
attachment, such as ITGB1. ITGB1 gene expression was 
also down-regulated with increasing emphysema severity 
in lung tissue (P = 0.008). Resolution of damaged tissue 
requires mesenchymal cells to attach to collagen fibers 
through integrin-dependent mechanisms and generate 
mechanical tension via the actin cytoskeleton to promote 
tissue contraction and wound size reduction. Using distal 
lung fibroblasts isolated from former smokers with and 
without COPD, we found that GHK (10 nM), like TGFpi 
(10 ng/ml), induced alterations in integin-pi localization 
(green staining in Figure 6a) and reorganized actin to 
form contractile filaments (red staining in Figure 6a). We 
further demonstrated using a three-dimensional collagen 
gel contraction bioassay that distal lung fibroblasts 
derived from former smokers with COPD (n = 5) were 
unable to fully contract collagen I gels compared to fibro- 
blasts obtained from former smokers without COPD (n = 
5, P < 0.05; Figure 6b,c; see Additional file 12 for subject 
demographics), similar to what has been previously 
described [35]. However, fibroblasts derived from COPD 
patients first treated for 48 h with either TGFp 1 or GHK 
were able to induce full collagen I gel contraction com- 
parable to that observed in fibroblasts from former 



smokers without COPD (P < 0.01, Figure 6b,c). Using the 
second harmonic generation properties of fibrilar col- 
lagen and multi-photon microscopy, we confirmed that 
fibroblasts from former smokers with COPD were unable 
to efficiently remodel collagen into fibrils (Figure 6d). 
Importantly, following 48 h of treatment with TGFpi or 
GHK on lung fibroblasts from former smokers with 
COPD, we were able to restore this intrinsic defect, 
which we propose is through organization of the actin 
cytoskeleton to a contractile phenotype as demonstrated 
by the confocal images displayed in Figure 6a. 

Discussion 

The goal of this study was to identify gene expression 
changes associated with regional emphysema severity in 
order to elucidate biological processes underlying the pro- 
gression of emphysema and to identify potential COPD 
therapeutics. By measuring gene expression from regions 
of varying emphysema severity within the same lung and 
by using a morphologic measurement of airspace size 
(Lm), which reflects the degree of alveolar destruction, we 
were able to identify gene expression changes associated 
specifically with the emphysematous component of 
COPD. 
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Figure 6 Effect of GHK treatment on collagen contraction by fibroblasts from former smokers with COPD. (a) Representative 
immunofluorescent images of distal lung fibroblasts from former smokers with and without COPD treated with GHK (10 nM), TGF(3 1 (10 ng/ml), 
or media control for 48 h and stained with phalloidin to localize the actin cytoskeleton (red), integrin-(31 antibody (green) and DAPI to localize 
nuclei (blue), (b) Representative images of collagen I gel bioassays at 24 h after being seeded with distal lung fibroblasts from former smokers 
with and without COPD previously treated with GHK, TGF(31, or media control for 48 h. (c) The percentage of collagen I contraction was 
significantly decreased in fibroblasts derived from former smokers with COPD compared to former smokers without COPD (P < 0.05) but was 
significantly increased with addition of TGF(31 or GHK (P < 0.01). (d) Representative enface Z-stack slices of three-dimensional reconstructed 
collagen I gel bioassays demonstrating actin in fibroblasts (green, phalloidin) and second harmonic signal originating from collagen fibrils 
(purple, 414 nM). Fibroblasts from former smokers with COPD were unable to efficiently remodel collagen into fibrils. However, this intrinsic 
defect was restored with treatment of TGF(3 1 or GHK. 
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Interestingly, there was significant enrichment between 
genes differentially expressed in COPD or associated 
with worsening lung function in other datasets and those 
we found to be associated with regional emphysema 
severity. Importantly, this similarity supports the notion 
that regional differences in emphysema severity reflect 
the processes that occur with general COPD pathogen- 
esis and progression and are not only present in patients 
with end-stage disease. Overall, these observations sug- 
gest a similarity in the gene expression alterations that 
accompany airflow obstruction, gas exchange abnormal- 
ities, and alveolar destruction measured by Lm. 

A common characteristic in the pathology of COPD is 
progressive lymphocyte infiltration of the small airways 
and alveolar walls [36] . In addition, the formation of ter- 
tiary lymphoid organs within this infiltration suggests the 
presence of an adaptive immune response to persistent 
foreign or autoimmune antigens [37,38]. The present 
study extends these observations by showing that the 
expression patterns of several components of the B-cell 
receptor signaling pathway have increased expression in 
regions of severe emphysema. Iga (CD79A) and IgP 
(CD79B) are proteins that associate with the B-cell recep- 
tor and transmit its signal upon stimulation. Immunohis- 
tochemistry showed a significant relationship between the 
volume fraction of the airway wall and alveolar tissue posi- 
tively stained for CD79A and an increase in Lm. This rela- 
tionship supports an increased number of B cells in both 
airway wall and alveolar tissues and is consistent with the 
induction of CD79A during tissue destruction associated 
with the increase in Lm. 

The TGFp signaling pathway is involved in a variety of 
cellular processes, including immune response, extracel- 
lular matrix remodeling, angiogenesis, and cell differen- 
tiation. This pathway has also been implicated in a 
variety of diseases such as cancer and fibrosis [39]. It has 
been hypothesized that the TGFp pathway could play a 
role in COPD pathogenesis, but its role is not completely 
understood [40]. Togo et al. [35] found that fibroblasts 
isolated from COPD patients exhibited reduced chemo- 
taxis, reduced nuclear to cytoplasmic ratios of phos- 
phorylated SMAD3, and decreased a-smooth muscle 
actin production compared to controls when treated with 
TGFp. Decreased mRNA expression or protein levels for 
TGFpi, TGFBR1 [41], SMAD3 [42], SMAD6 [43], and 
SMAD7 [41,43] have been reported in more advanced 
stages of COPD or fibroblasts from COPD patients. In 
both alveolar and bronchiolar epithelium of emphysema- 
tous lungs, a decrease in phosphorylated SMAD2 has 
been shown by immunohistochemistry [44]. In normal 
human lung parenchyma, repair processes in response to 
mechanical injury are associated with increased TGFp 
signaling, while a decrease in expression has been 
observed for TGFp -related genes with worsening lung 



function in patients with COPD [25,45]. Furthermore, 
association studies have identified both promoter and 
coding region polymorphisms in the TGF/51 gene that 
associate with increased risk for COPD [46-48]. In the 
present study, we identified several components of the 
TGFp and BMP pathways that have decreasing expres- 
sion with increasing emphysema severity. In the BMP 
pathway, ACVRL1 and ENG are receptors involved in 
the phosphorylation of SMAD1 and are expressed in the 
mature lung vasculature. The changing expression of 
SMAD6 and SMAD1, their localization predominantly to 
vascular endothelial cells, and the roles of ACVRL1 and 
ENG in angiogenesis support the hypothesis of aberrant 
tissue remodeling in the lung vasculature during emphy- 
sema pathogenesis. In the TGFp pathway, TGFBR2 is a 
receptor involved in the phosphorylation of SMAD2/3 
and is important for many tissue remodeling processes, 
including wound repair. Moreover, genes found to be 
induced by TGFp in diverse studies were down-regulated 
in regions of severe emphysema. The localization of 
SMAD2 to alveolar and airway tissue and the decreased 
TGFp pathway activity seen with increasing emphysema 
severity support the hypothesis that a decrease in TGFp 
pathway activity also contributes to emphysema 
pathogenesis. 

As COPD remains a major public health concern due to 
lack of effective therapeutic strategies, we sought to use 
computational methods to identify compounds that might 
modulate molecular processes associated with emphysema 
pathogenesis. The CMap is a large compendium of micro- 
array experiments that measures the effect of over 1,000 
compounds on gene expression in several cell lines [13]. 
By querying a gene expression signature of disease patho- 
genesis against the CMap dataset, one can find com- 
pounds that elicit a pattern of gene expression that is the 
opposite to the disease-related gene expression profile. 
This can lead to the hypothesis that such compounds, 
since they reverse the disease-related gene expression pat- 
tern, are potential therapeutics for that disease. This 
approach has been recently successful in the therapeutic 
repositioning of the antiulcer drug cimetidine to lung ade- 
nocarcinoma and the anticonvulsant drug topiramate to 
inflammatory bowel disease [49,50]. In these studies, sig- 
natures for each disease were derived using several pub- 
licly available gene-expression datasets and queried in the 
CMap. Candidate compounds or drugs that could signifi- 
cantly reverse the disease-related signatures of gene 
expression were further validated in vitro, showing that 
this computational method is a viable approach for identi- 
fying novel therapeutics. 

Using the CMap dataset, we identified a relationship 
between the gene expression changes induced by the tri- 
peptide GHK and those that are repressed with increasing 
emphysema severity. Intriguingly, we further found that 
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GHK-treatment induced a pattern of gene expression 
similar to that resulting from TGFp pathway activation. 
We replicated both of these findings in human lung fibro- 
blasts, which are the major interstitial cells that maintain 
tissue structural integrity by sculpting the connective tis- 
sue. GHK-Cu is a natural tripeptide that, in human 
plasma, can be found at a concentration of 200 ng/ml at 
the age of 20 years but drops to around 80 ng/ml by the 
age of 60 years [34]. Characterization GHK-Cu in skin 
wound repair models suggests that it induces wound con- 
traction, cell proliferation, angiogenesis, and increased 
expression of antioxidant enzymes and integrins [34,51]. 
Direct evidence for the ability of GHK-Cu to promote 
wound healing comes from experimental rat models 
where GHK treatment causes an acceleration of healing 
and a concentration-dependent increase of connective tis- 
sue and other ECM components [32,33]. These effects are 
consistent with the gene expression alterations induced by 
GHK and TGFp treatment. Moreover, we confirmed these 
similarities by demonstrating that GHK and TGFp 
induced significantly higher expression and re-organiza- 
tion of actin and integrin-P 1 in distal lung fibroblasts. 

We further assessed the ability of GHK and TGFp to 
induce tissue contraction. As in previous studies [35], 
we demonstrated that distal lung fibroblasts derived 
from COPD patients have intrinsic defects in collagen I 
contraction compared to fibroblasts derived from former 
smokers without COPD. When fibroblasts from COPD 
lungs were treated with GHK or TGFp contraction and 
remodeling of collagen gels was induced to levels com- 
parable to fibroblasts from former smokers without 
COPD. We further demonstrated that the collagen con- 
traction induced in COPD fibroblasts by GHK involves 
the organization of collagen I gels into collagen fibrils 
using multi-photon microscopy. Taken together, these 
data further support the hypothesis in which a wound- 
healing-like process is diminished as a function of 
emphysema progression and further suggest that this 
process is related to the TGFp pathway. 

While the number of subjects in this study for genomic 
analysis was small, the analysis of eight specimens per 
lung representing different degrees of emphysema from 
each individual allowed us to detect gene expression 
changes specifically associated with regional emphysema 
severity. We further demonstrated that these genes are 
concordantly differentially expressed in previous cross- 
sectional studies involving larger numbers of individuals 
with varying degrees of airflow limitation. These results 
validate the gene expression differences associated with 
regional emphysema severity in independent cohorts 
from different clinical settings and support the hypoth- 
esis that the genes whose expression is associated with 
regional emphysema severity reflect the activity of true 
disease-associated processes. As demonstrated by our 



micro-CT data, COPD is a heterogeneous disease within 
the lung [12]. Further studies will be required to assess 
whether COPD-associated differences in ECM remodel- 
ing by distal fibroblasts in vitro is associated with the 
regional disease severity in the tissue from which the 
fibroblasts are derived. 

Conclusions 

This study has provided insights into molecular pro- 
cesses associated with emphysematous destruction of 
the lung and revealed mechanisms that contribute to 
the pathogenesis of COPD. Whole genome gene-expres- 
sion analysis supports the role of the immune response 
in regional emphysema and elucidates additional path- 
ways involved in the process of emphysematous destruc- 
tion. The suggestion that progressive emphysematous 
destruction is associated with down-regulation of genes 
involved in or downstream of tissue remodeling and 
wound repair pathways supports a role for defects in 
ECM homeostasis and angiogenesis in the emphysema- 
tous destruction that occurs with chronic inflammation 
in COPD. We propose that these processes could be 
linked through decreased TGFp pathway activation. 
These data are supported by our identification of GHK 
as a compound with the potential to mimic TGFp path- 
way activity and induce collagen contraction, an impor- 
tant functional component of wound repair. 

Additional material 



Additional file 1: Supplementary methods. 

Additional file 2: Statistical results for gene expression analysis. 

Additional file 3: Functional categories enriched among genes 
associated with regional emphysema severity. 

Additional file 4: Gene expression relevance network. Dark blue 
circles are genes that have expression significantly correlated with Lm; 
light blue circles are all other genes. Edges are indicated by green 
(positive correlation) or red (negative correlation) lines. 

Additional file 5: RT-PCR validation of 14 genes associated with 
regional emphysema severity. 

Additional file 6: Confirmation of gene expression changes 
associated with regional emphysema severity (Lm) within individuals 
with emphysema using GSEA. Genes associated with unsealed Lm 
measurements identified using all eight patients in the analysis are 
concordantly enriched among genes associated with scaled Lm 
measurements (Z-scored within each patient) using only the five 
emphysema patients (FDR <0.001). These results demonstrate that the 127 
gene signature is related to regional emphysema severity within individuals 
and not to differences between donors and COPD patients or to differences 
in levels of emphysema between COPD patients. Orange and blue color 
bars represent the t-statistics from correlations of gene expression with Lm. 
The vertical black lines represent the position of genes in the gene set 
among the ranked gene list. The length of the black lines corresponds to 
the magnitude of the running enrichment score from GSEA. 

Additional file 7: Relation between gene expression changes 
associated with regional emphysema severity (Lm) and studies of 
TGFp-related gene expression using GSEA. Genes associated with Lm 
are enriched among the genes that are differentially expressed in response 
to TGFp treatment in datasets from (a) Classen et a/.[20], (b) Koinuma et al. 
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[22], and (c) Malizia et a I. [23]. (d) Genes most induced by TGFp in seven 
studies [19-23,30,31] are enriched among the genes that are associated with 
Lm. Orange and blue color bars represent the t-statistics from correlations 
of gene expression with a continuous variable. Red and green color bars 
represent the fold change between samples treated with and without 
TGFf3. The vertical black lines represent the position of genes in the gene 
set among the ranked gene list. The length of the black lines corresponds 
to the magnitude of the running enrichment score from GSEA. Enrichments 
with an FDR q-value <0.05 were considered significant. 

Additional file 8: Relation between gene expression changes 
associated with regional emphysema severity (Lm) and gene 
expression changes that occur with treatment of GHK or TGFp in 
fibroblast cell lines using GSEA. (a) Genes increasing in expression in 
response to treatment with GHK or TGFp are enriched among genes that 
decrease with increasing emphysema severity, (b) Genes differentially 
expressed with TGFp treatment or in response to GHK in the 
Connectivity Map are enriched among genes that change in expression 
with GHK (0.1 nM) in fibroblast cell lines, (c) Genes that are differentially 
expressed with TGFp treatment or that are down-regulated with 
increasing emphysema severity are enriched among genes that change 
in expression with GHK (10 nM) in fibroblast cell lines, (d) Genes that are 
differentially expressed in response to GHK are concordantly enriched 
among genes that change in expression with TGFp treatment in 
fibroblast cell lines. Orange and blue color bars represent the t-statistics 
from correlations of gene expression with a continuous variable. Red and 
green color bars represent the t-statistic between treated and untreated 
samples. The vertical black lines represent the position of genes among 
the ranked gene list. The length of the black lines corresponds to the 
magnitude of the running enrichment score from GSEA. Enrichments 
with an FDR q-value <0.05 were considered significant. 

Additional file 9: Localization of members of the TGFp superfamily 
using immunohistochemistry. Representative images of positive 
SMAD2 staining (arrows) in the (a) alveolar and (b) small airway wall 
tissue, (c) Representative image of positive SMAD6 staining in vascular 
endothelial cells (arrows) and macrophages (arrowheads), (d) 
Representative image of weak SMAD1 staining in vascular endothelial 
cells (arrows). Representative images are shown for control IgG staining 
in the (e) alveolar wall tissue, (f) airway wall tissue, and (g) blood vessels. 
Scale bar = 200 urn. 

Additional file 10: Relation between gene expression changes 
associated with regional emphysema severity (Lm) and cross-sectional 
studies of COPD-related gene expression using GSEA. Genes associated 
with Lm are enriched among the genes found to associated with the 
presence of COPD or degree of airflow obstruction in datasets from 
(a) Golpon et a/. [61 (b) Spira et alffl, (c) Wang et a/.[10], and (d) Bhattacharya 
et al{7\ (e) Genes previously found to be associated with COPD-related 
clinical variables [6,8-10] are enriched among the genes associated with Lm. 
Orange and blue color bars represent the t-statistics from correlations of gene 
expression with a continuous variable. Red and green color bars represent the 
t-statistic from a r-test between cases and controls. The vertical black lines 
represent the position of genes in the gene set among the ranked gene list. 
The length of the black lines corresponds to the magnitude of the running 
enrichment score from GSEA. Enrichments with an FDR q-value <0.05 were 
considered significant. 

Additional file 11: Examples of genes associated with Lm and 
validated by quantitative RT-PCR that were also differentially 
expressed in other COPD-related gene-expression datasets. Genes 
such as ACVRL1, SMAD6, CCR7, and CXCL13 were associated with increasing 
regional emphysema severity and concordantly differentially expressed in 
other datasets such as Golpon et a I. [6] and/or Wang et al.[\0]. 

Additional file 12: Subject demographics for lung fibroblast cultures. 
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